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We  analyzed  the  variation  of  eighteen  miniSTR  loci  in  411  randomly  chosen  individuals  from  Korea  to 
increase  the  probability  that  a  degraded  sample  can  be  typed,  as  well  as  to  provide  an  expanded  and 
reliable  population  database.  Six  multiplex  PCR  systems  were  developed  (multiplex  I:  D1S1677,  D2S441 
and  D4S2364;  multiplex  II:  D10S1248,  D14S1434  and  D22S1045;  multiplex  III:  D12S391,  D16S3253  and 
D20S161 ;  multiplex  IV:  D3S4529,  D8SU15  and  D18S853;  multiplex  V:  D6S1017,  D11S4463  and 
D17S1301;  multiplex  VI:  D5S2500,  D9S1122  and  D21S1437).  Allele  frequencies  and  forensic  parameters 
were  calculated  to  evaluate  the  suitability  and  robustness  of  these  non-CODIS  miniSTR  systems.  No 
significant  deviation  from  Hardy— Weinberg  equilibrium  expectations  were  observed,  except  for 
D4S2364,  D5S2500  and  D20S161  loci.  A  multidimensional  scaling  plot  based  on  allele  frequencies  of  the 
six  miniSTR  loci  (D1S1677,  D2S441,  D4S2364,  D10S1248,  D14S1434  and  D22S1045)  showed  that  Koreans 
appeared  to  have  most  genetic  affinity  with  Chinese  and  Japanese  than  to  other  Eurasian  populations 
compared  here.  The  combined  probability  of  match  calculated  from  the  18  miniSTR  loci  was  2.902  x  10 
-17,  indicating  a  high  degree  of  polymorphism.  Thus,  the  18  miniSTR  loci  can  be  suitable  for  recovering 
useful  information  for  analyzing  degraded  forensic  casework  samples  and  for  adding  supplementary 
genetic  information  for  a  variety  of  analyses  involving  closely  related  individuals  where  there  is  a  need 
for  additional  genetic  information. 

©  2013  Elsevier  Ltd  and  Faculty  of  Forensic  and  Legal  Medicine.  All  rights  reserved. 


1.  Introduction 

Autosomal  short  tandem  repeat  (STR)  markers  are  among  the 
most  powerful  tools  for  forensic  genetics  including  individual 
identity  and  paternity  testing  because  of  their  numerous  alleles, 
high  genetic  diversity  and  stable  heredity  in  the  human  genome. 
However,  many  forensic  laboratories  often  encounter  difficulties  in 
further  analyzing  a  poor-quality  sample.  Exposure  of  DNA  to  ele¬ 
ments  or  to  fire  for  a  certain  length  of  time,  degradation  can  start 
due  to  bacterial,  biochemical,  or  oxidative  processes.  Thus,  forensic 
DNA  samples  that  are  often  subjected  to  environmental  factors 
promoting  degradation  can  pose  challenges  for  data  interpretation. 
In  such  a  circumstance,  loss  of  a  signal  is  typically  observed  with  a 
larger-sized  STR  amplicon.  Due  to  the  extensive  DNA  fragmentation 
in  many  forensic  cases,  standard  STR  testing  is  often  inadequate  by 


*  Corresponding  author.  Tel.:  +82  70  7154  3441;  fax:  +82  41  559  7861. 
E-mail  address:  wookkim@dankook.ac.kr  (W.  Kim). 


using  a  commercially  available  multiplex  STR  genotyping  kit. 
Another  approach  to  recover  information  from  degraded  DNA 
samples  is  to  use  the  reduced  size  of  the  PCR  products.  Reducing  the 
size  of  the  PCR  product  (miniSTR)  by  moving  primers  in  as  close  as 
possible  to  the  STR  repeat  region  is  an  established  and  highly 
effective  method  for  improving  the  forensic  analysis  of  highly 
degraded  DNA  samples. 

Applications  of  STR  analysis  in  forensic  casework  benefit  from 
large  population  databases  for  estimating  the  probability  of  iden¬ 
tity  by  chance.  The  use  of  additional  STR  markers  would  provide 
sufficient  forensic  parameters  for  more  difficult  cases  in  paternity 
or  maternity  analyses,  such  as  deficient  cases  (i.e.,  only  the  alleged 
father  and  the  child  are  included),  missing  persons,  or  in  case  of 
mutations.  Thus,  it  is  important  that  the  STR  databases  continue  to 
be  expanded  and  made  more  reliable  to  provide  a  better  tool  for 
forensic  analysis.  National  DNA  databases  initiated  in  Korea  in  2010 
comprise  the  13  CODIS  markers  in  the  United  States  as  well  as  the 
well-known  D2S1338,  D19S433,  Penta  E  and  Penta  D  loci.  Many 
forensic  communities  have  proposed  the  inclusion  of  additional 
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loci,  since  the  potential  false  matches  with  a  large  number  of 
comparisons  being  made  within  and  between  databases.  Thus, 
the  analysis  of  extended  non-CODIS  STR  markers  may  be  a  poten¬ 
tially  powerful  tool  for  forensic  analyses  in  the  Korean  population. 

In  this  study,  we  present  the  distribution  of  allelic  frequencies 
and  forensic  efficiency  parameters  of  18  non-CODIS  miniSTR  loci 
(Supplementary  Table  SI;  multiplex  I:  D1S1677,  D2S441  and 
D4S2364;  multiplex  II:  D10S1248,  D14S1434  and  D22S1045; 
multiplex  III:  D12S391,  D16S3253  and  D20S161;  multiplex  IV: 
D3S4529,  D8S1115  and  D18S853;  multiplex  V:  D6S1017,  D11S4463 
and  D17S1301 ;  multiplex  VI:  D5S2500,  D9S1122  and  D21S1437) 
from  411  randomly  selected  individuals  in  Korea. 

2.  Materials  and  methods 

2.1.  Sample  collection  and  DNA  extraction 

Blood  or  buccal  swab  samples  were  collected  from  411  unrelated 
healthy  individuals  in  Cheonan,  Korea.  The  familial  history  of  par¬ 
ticipants  was  obtained  to  exclude  their  relatives  from  the  analysis. 
The  surname  was  not  considered,  because,  proportions  of  top  five 
surnames  were  reached  to  60%  among  more  than  200  surnames  in 
Korea.  A  separate  written  informed  consent  was  obtained  from  all 
the  donors  before  collecting  their  blood  or  buccal  swab.  Genomic 
DNA  was  extracted  from  whole  blood  or  buccal  cells  by  the  QIAamp 
DNA  Mini  Kit  (Qiagen,  Hilden,  Germany).  The  DNA  samples  were 
quantified  using  the  NanoDrop  2000  Spectrophotometer  (Thermo 
Scientific,  Wilmington,  DE,  USA). 

2.2.  PCR  amplification 

For  genotyping  of  the  18  miniSTR  loci  (Supplementary  Table  SI ), 
six  triplex  PCRs  were  performed  using  the  primer  sets  mentioned 
in  previous  studies.  All  these  loci  are  located  on  different 
chromosomes  and  D2S441  and  D22S1045  are  included  in  the 
Extended  European  Standard  Set  (ESS).  Several  fluorescent  dyes  for 
primer  labeling  were  modified  to  accommodate  the  subsequent 
PCR  fragment  detection  in  capillary  electrophoresis.  The  forward 
primers  for  D4S2364,  D10S1248,  D16S3253,  D8S1115,  D6S1017  and 
D5S2500  were  labeled  with  FAM.  The  primers  for  D2S441, 
D14S1434,  D20S161,  D18S853,  D11S4463  and  D9S1122  were 
labeled  with  HEX.  The  primers  for  D1S1677,  D22S1045,  D12S391, 
D3S4529,  D17S1301  and  D21S1437  were  labeled  with  NED 
(Supplementary  Table  S2).  All  primer  sets  were  designed  to  amplify 
the  PCR  products  with  a  maximum  fragment  size  of  <145  bp.  Each 
multiplex  PCR  reaction  was  performed  using  a  total  volume  of  10  pi 
containing  1  ng  genomic  DNA,  1  x  PCR  Gold  buffer  (Applied  Bio¬ 
systems,  Foster  City,  CA,  USA),  1.0  U  AmpliTaq  Gold®  DNA  poly¬ 
merase  (Applied  Biosystems),  1.5  mM  MgC^,  200  pM  of  each  dNTP, 
and  a  primer  concentration  adjusted  to  balance  the  peak  height. 
Thermal  cycling  was  conducted  on  the  GeneAmp®  PCR  System 
9700  (Applied  Biosystems)  with  slight  modification  in  the  PCR 
condition.  Initial  denaturation  was  performed  at  95  °C  for  10  min, 
followed  by  30  cycles  of  94  °C  for  30  s,  55  °C  (except  for  multiplex 
III:  58  °C;  multiplex  IV:  64  °C;  and  multiplex  V,  and  VI:  61  °C)  for 
30  s,  72  °C  for  45  s,  and  a  final  extension  of  60  °C  for  45  min. 

2.3.  Genotyping 

For  the  genotyping,  the  PCR  products  were  mixed  with  the 
GeneScan™  500  ROX™  Size  Standard  (Applied  Biosystems).  Capil¬ 
lary  electrophoresis  was  performed  on  the  ABI  PRISM™  310  Genetic 
Analyzer  (Applied  Biosystems).  The  results  were  analyzed,  and 
allele  designations  were  determined  by  the  GeneMapper  4.1  soft¬ 
ware  (Applied  Biosystems)  with  sequenced  allelic  ladders.  The 


allelic  ladders  for  multiplexes  I-VI  were  created  using  a  combina¬ 
tion  of  individual  templates,  which  represent  the  range  for  alleles 
observed  by  Han  et  al.  and  in  this  study.  At  least  two  different 
homozygous  samples  for  each  miniSTR  marker  were  sequenced  to 
confirm  the  allele  nomenclature  by  a  procedure  described  previ¬ 
ously.  Commercial  DNA  standard  9947 A  and  9948  (Promega, 
Madison,  WI,  USA)  were  genotyped  as  the  positive  standard  refer¬ 
ence  for  quality  control.  In  addition,  dd^O  was  used  as  the  nega¬ 
tive  control.  A  concordant  study  was  performed  to  ensure 
reproducible  and  accurate  genotyping  by  a  re-genotyping  proce¬ 
dure  by  using  more  than  20  samples  for  each  multiplex. 

2.4.  Statistical  analyses 

General  goodness-of-fit  test  of  x2-test  were  used  to  assess 
Hardy-Weinberg  equilibrium  (HWE).  Possible  divergence  from 
HWE  was  also  examined  by  the  Exact  Test.  These  statistical  ana¬ 
lyses  were  performed  with  the  PowerMarker  v3.25  and  Arlequin 
v3.5.  The  genetic  distance  was  estimated  by  shared  allele  genetic 
distance  (Dsa).  The  genetic  distance  values  were  also  displayed  as 
a  multidimensional  scaling  (MDS)  plot  by  using  SPSS  statistics  19 
(SPSS  Korea,  Seoul,  Korea).  Important  parameters  of  forensic  pa¬ 
ternity  testing  were  calculated  using  PowerStats  vl.2. 

3.  Results  and  discussion 

We  analyzed  six  miniSTR  multiplex  systems  for  the  typing  of  18 
non-CODIS  loci.  None  were  located  on  the  same  chromosome.  We 
assessed  the  statistical  parameters  for  applications  to  forensic  and 
population  genetic  studies  by  using  these  18  miniSTR  loci  from  the 
Korean  samples  surveyed  in  this  study.  The  distributions  of  allelic 
frequencies  and  forensic  parameters  of  the  18  minSTR  loci  from  the 
411  Koreans  are  summarized  in  Table  1.  The  exact  tests  were  per¬ 
formed  on  all  18  miniSTR  loci  to  determine  HWE  and  the  p-values 
are  listed  in  Table  1.  No  significant  deviation  from  the  HWE  ex¬ 
pectations  was  observed,  except  for  D4S2364,  D5S2500  and 
D20S161  loci.  After  Bonferroni  correction,  these  three  loci  were  still 
significant  in  the  HWE  test  (p  <  0.00278).  Deviation  from  HWE  of 
D4S2364  in  Korean  subjects  and  D5S2500  in  Chinese  subjects  has 
been  previously  reported,  but  deviation  of  the  D20S161  locus 
has  not  been  reported.  Additional  studies  with  larger  sample  sizes 
are  necessary  to  understand  the  cause  of  deviation  from  the  equi¬ 
librium  model  at  the  three  loci  surveyed  in  this  study. 

Based  on  our  result  of  allele  distributions  of  18  STR  loci,  the 
number  of  alleles  varied  from  four  at  D4S2364  to  12  at  D8S1115 
(Supplementary  Table  SI).  The  observed  heterozygosity  (H0bs)  was 
greater  than  0.7  for  12  out  of  18  loci.  Of  the  18  loci,  D11S4463  was 
the  most  informative  miniSTR  locus  (H0bs  value  at  0.7835),  whereas 
D5S2500  was  the  least  informative  miniSTR  locus  (H0bs  value  at 
0.6010).  Individual  probability  of  match  (PM)  values  ranged  from 
0.0597  at  D12S391  to  0.2318  at  D4S2364.  The  power  of  discrimi¬ 
nation  (PD)  varied  from  0.7682  at  D4S2364  to  0.9403  at  D12S391. 
The  paternity  power  of  exclusion  (PPE)  ranged  from  0.3682  at 
D2S441  to  0.5677  at  D11S4463.  The  combined  PM  calculated  from 
the  18  minSTR  loci  studied  was  2.902  x  10~17,  and  the  PPE  was 
estimated  to  be  0.9999871.  In  contrast,  Coble  et  al.  reported  two 
miniplex  systems  called  NC01  (D10S1248,  D14S1434  and 
D22S1045)  and  NC02  (D1S1677,  D2S441  and  D4S2364),  showing 
that  combined  PM  and  PPE  from  the  six  miniSTR  loci  were 
7.409  x  10-6  and  0.974,  respectively.  The  authors  proposed  the 
inclusion  of  further  or  different  STR  loci  to  satisfy  the  increasing 
requirements  of  forensic  parameters  for  obtaining  additional  in¬ 
formation.  Thus,  our  results  indicate  that  these  18  miniSTR  loci 
have  relatively  higher  probability  of  exclusion  as  well  as  PD  and 
therefore  can  be  useful  in  forensic  analysis. 
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Table  1 

Allele  frequencies  and  statistical  parameters  for  eighteen  miniSTR  loci  in  the  Korean  population  (n  =  411). 


Allele 

D1S1677 

D2S441 

D3S4529 

D4S2364 

D5S2500 

D6S1017 

D8S1115 

D9S1122 

D10S1248 

6 

7 

0.034 

8 

0.174 

0.268 

0.001 

0.007 

9 

0.001 

0.470 

0.173 

0.004 

10 

0.007 

0.181 

0.322 

0.328 

0.001 

0.091 

11 

0.002 

0.466 

0.026 

0.006 

0.147 

0.004 

11.3 

0.017 

12 

0.007 

0.170 

0.004 

0.002 

0.296 

0.303 

0.072 

13 

0.120 

0.056 

0.165 

0.073 

0.001 

0.369 

0.387 

14 

0.467 

0.100 

0.302 

0.370 

0.007 

0.023 

0.067 

0.238 

15 

0.331 

0.009 

0.322 

0.002 

0.002 

0.038 

0.012 

0.206 

16 

0.058 

0.165 

0.001 

0.477 

0.079 

17 

0.006 

0.041 

0.345 

0.217 

0.015 

18 

0.251 

0.050 

19 

0.001 

0.012 

20 

0.024 

0.001 

21 

22 

23 

0.002 

24 

25 

Hobs3 

0.6618 

0.6594 

0.7348 

0.7202 

0.6010 

0.7518 

0.6618 

0.7129 

0.7664 

H  b 

°exp 

0.6542 

0.7076 

0.7486 

0.6441 

0.6804 

0.7270 

0.6912 

0.7377 

0.7396 

X2-test  (P) 

1.0000 

0.0006 

0.5333 

0.0001 

0.0000 

0.8019 

0.9991 

0.0001 

0.6280 

Exact  test  (P) 

0.9267 

0.0682 

0.5673 

0.0000 

0.0000 

0.5470 

0.4185 

0.2282 

0.5482 

PM 

0.1845 

0.1229 

0.1063 

0.2318 

0.1616 

0.1324 

0.1376 

0.1125 

0.1104 

PDe 

0.8155 

0.8771 

0.8937 

0.7682 

0.8384 

0.8676 

0.8624 

0.8875 

0.8896 

PICf 

0.5945 

0.6712 

0.7059 

0.5778 

0.6112 

0.6771 

0.6501 

0.6963 

0.6992 

PPEg 

0.3717 

0.3682 

0.4841 

0.4602 

0.2885 

0.5119 

0.3705 

0.4485 

0.5383 

Allele 

D11S4463 

D12S391 

D14S1434 

D16S3253 

D17S1301 

D18S853 

D20S161 

D21S1437 

D22S1045 

6 

0.068 

7 

0.001 

0.007 

8 

0.146 

0.007 

0.001 

9 

0.007 

0.060 

0.040 

10 

0.001 

0.120 

0.336 

0.079 

0.002 

0.089 

11 

0.004 

0.153 

0.353 

0.189 

0.376 

0.125 

0.190 

11.3 

12 

0.049 

0.030 

0.012 

0.405 

0.062 

0.019 

0.002 

13 

0.195 

0.243 

0.012 

0.213 

0.264 

0.039 

0.001 

14 

0.314 

0.436 

0.011 

0.049 

0.218 

0.524 

0.005 

15 

0.279 

0.019 

0.013 

0.011 

0.075 

0.045 

0.161 

0.299 

16 

0.124 

0.007 

0.004 

0.001 

0.162 

0.035 

0.251 

17 

0.027 

0.102 

0.412 

0.006 

0.227 

18 

0.001 

0.297 

0.002 

0.155 

0.021 

19 

0.240 

0.135 

0.004 

20 

0.153 

0.082 

21 

0.090 

0.009 

22 

0.051 

0.001 

23 

0.026 

24 

0.013 

25 

0.001 

Hobs3 

0.7835 

0.7421 

0.6837 

0.7202 

0.7762 

0.7324 

0.7518 

0.6740 

0.7810 

LJ  b 

nexp 

0.7674 

0.8085 

0.7120 

0.7329 

0.7445 

0.7320 

0.7529 

0.6725 

0.7594 

x2-test  (P) 

0.1527 

0.5570 

0.9168 

0.8198 

0.9032 

0.9431 

0.0036 

0.3587 

0.5329 

Exact  test  (P) 

0.1113 

0.3350 

0.7806 

0.1822 

0.7000 

0.7811 

0.0006 

0.3368 

0.0653 

PM 

0.0951 

0.0597 

0.1223 

0.1189 

0.1063 

0.1143 

0.0971 

0.1397 

0.1044 

PDe 

0.9049 

0.9403 

0.8777 

0.8811 

0.8937 

0.8857 

0.9029 

0.8603 

0.8956 

PICf 

0.7294 

0.7837 

0.6693 

0.6902 

0.7096 

0.6873 

0.7224 

0.6403 

0.7174 

PPEg 

0.5677 

0.4963 

0.4035 

0.4602 

0.5556 

0.4801 

0.5129 

0.3880 

0.5643 

a  Hobs:  observed  heterozygosity. 
b  Hexp:  expected  heterozygosity. 
c  Exact  test  (Monte  Carlo  method). 
d  PM,  probability  of  match. 
e  PD,  power  of  discrimination. 
f  PIC,  polymorphism  information  content. 
g  PPE,  paternity  power  of  exclusion. 


A  population  comparison  based  on  the  Dsa  values  calculated 
from  the  allele  frequencies  of  the  six  miniSTR  loci  in  NC01  and  NC02 
systems  between  the  Korean  population  and  other  published 
populations  is  shown  in  Supplementary  Table  S3.21-30  An  MDS  plot 


of  Dsa  based  on  the  results  presented  in  Supplementary  Table  S3  is 
depicted  in  Fig.  1.  The  plot  shows  two  distinct  clusters  (East  Asians 
and  African/Europeans/Americans),  one  intermediate  population 
(Indian-Singaporean)  and  one  out-population  (Brazilian  Mulatto). 
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Han  Chinese  □  , 
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Korean  Chinese  □  D 
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by  National  Research  Institute  of  Cultural  Heritage  of  Cultural 
Heritage  Administration,  Republic  of  Korea. 

Appendix  A.  Supplementary  data 

Supplementary  data  related  to  this  article  can  be  found  at  http:// 
dx.doi.org/10.1016/j.jflm.2013.09.027. 
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Fig.  1.  Multidimensional  scaling  plot  of  shared  allele  genetic  distance  (Ds a)  based  on 
the  results  presented  in  Supplementary  Table  S3  (stress  =  0.17).  Korean  population  is 
represented  by  diamond,  Asians  by  squares,  Europeans  by  triangles,  Americans  by 
crosses,  and  Africans  by  circles. 


As  expected,  Koreans  appeared  to  have  the  closest  genetic  affinity 
with  Chinese  and  Japanese  (Fig.  1 ).  This  result  was  consistent  with  a 
previous  report  derived  from  the  datasets  of  mitochondrial  DNA 
and  Y  chromosome  markers.  Indian  Singaporeans  were  located  as 
an  intermediate  between  the  East  Asian  cluster  and  others.  It  has 
been  reported  that  peopling  of  the  Indian  subcontinent  is  charac¬ 
terized  by  a  complex  history,  with  contributions  from  different 
ancestral  populations.  The  Brazilian  Mulatto  population  was 
isolated  from  the  main  cluster  of  the  MDS  plot.  This  may  reflect 
their  mixed  ethnicity,  because  Mulattos  are  descendants  of  Euro¬ 
pean  and  African  ancestry/lineage. 

In  summary,  our  study  demonstrates  that  the  18  non-CODIS 
minSTR  loci  are  highly  polymorphic.  Additional  these  miniSTR 
markers,  together  with  the  conventional  STRs  of  CODIS,  multi-locus 
commercial  kits  (AmpFISTR®  Identifier®  and  Powerplex®  16)  or 
miniSTR  commercial  kit  (AmpFISTR®  Minifiler®)  may  help  resolve 
more  complex  forensic  cases.  Thus,  the  18  miniSTR  markers  can 
serve  as  useful  complements  to  other  commercial  STR  systems  for 
adding  supplementary  genetic  information  not  only  for  the  typing 
of  degraded  samples  but  also  for  a  variety  of  analyses  involving 
closely  related  individuals  when  additional  genetic  information  is 
required.  Further  surveys  of  the  miniSTR  systems  to  obtain  addi¬ 
tional  evidence-based  data  would  provide  further  insights. 
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